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Vectors effective for the expression and secretion of heterologous genes in streptomycetes are disclosed. Such vec- 
tors comprises a plasmid replicable in Streptomyces, which comprises a Streptomyces promoter and a DNA sequence en- 
coding a Streptomyces signal sequence operably linked to, and under control of the promoter, which DNA sequence en- 
coding a Streptomyces signal sequence may be operably linked to a heterologous gene encoding a desired protein so that 
the Streptomyces signal sequence and heterologous protein are expressed, and the desired protein is secreted under control 
of the Streptomyces signal sequence. Various Streptomyces promoters and secretion signal sequences may be used in the in- 
vention, and DNA sequences comprising the promoters operably linked to DNA sequences encoding Streptomyces signal 
sequences, are disclosed as part of the invention. Vectors comprising the Streptomyces aph, ermE, ermEal and modifica- 
tions thereof, controlling the expression of the amy and ORF438 signal sequences, which lead to secretion of desired heter- 
ologous proteins when transformed into Streptomyces hosts, are exemplified. 
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Expression of heterologous genes in streptomyces 
species . 

5 This invention comprises novel recombinant DNA cloning 

vectors suitable for the expression and secretion of heterologous 
proteins in Streptomyces , Streptomyces hosts transformed with such 
vectors, and methods for the secreted production of heterologous 
proteins in Streptomyces . 

10 Most of the streptomycetes bacteria are soil 

microorganisms. They synthesize secreted proteins and release them 
into surrounding environments. Up to the present, the genus Bacillus 
has attracted most of the interest as a Gram-positive host system for 
the secretion of heterologous proteins using recombinant DNA 

15 methods. However, even when grown to high density, transformed 
Bacilli rarely produce no greater than 10 yg/ml of secreted 
heterologous- mammalian proteins. Streptomyces , many strains of which 
are known in the pharmaceutical arts as the source of antibiotics, are 
an attractive host system for secreted expression of protein because 

20 they are known to secrete antibiotics. 

In general, a number of cloning vectors for use in 
Streptomyces are known. In particular, U.S. Patent Nos. 4,513,085 and 
4,513,085 disclose selectable recombinant DNA cloning vectors for use 
in Streptomyces . Generally, the foregoing patents describe 

25 recombinant DNA cloning vectors which comprise an origin of 
replication on a restriction fragment of the plasmid, together with 
one or more DNA segments that confer resistance to antibiotics. The 
resistance to antibiotics may be used as a selection marker by which 
transf ormants of the plasmids may be selected. 

30 Neither of the patents however disclose the isolation of 

functional promoters or secretion signal sequences that may be used 
either alone or together to express or secrete particular heterologous 
proteins on a recombinant plasmid suitable for use in St reptomyces . 
European Patent Application (EPA) 84303755.7 published July 17, 1985 
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describes recombinant pi asmids suitable for expressing foreign DNA 
sequences in Streptomyces in which the plasmid is characterized by a 
promoter of the Streptomyces aminoglycoside phosphotransferases. The 
expression of the eucaryotic protein bovine growth hormone as a rion- 
secreted product in Streptomyces is described. The patent application 
asserts that the methods of the invention enable production and 
excretion of products in Streptomyces through recombinant DNA 
technology. In addition, there is a general description of an 
embodiment of an expression plasmid in which DNA encoding a signal 
sequence of a polypeptide or protein normally secreted from a 
Streptomyces host employed in the expression system is operatively 
linked to the Streptomyces aminoglycoside phosphotransferase (aph) 
promoter which is used as the expression control system, to achieve 
the predicted production of a putative fusion protein. The putative 
fusion protein is predicted to effect the transport of the fusion 
protein through the Streptomyces cell wall and the maturation of the 
desired protein by cleavage of the signal sequence. The disclosure 
however lacks the description of any suitable Streptomyces or other 
secretion signal and, furthermore, fails to show i single example in 
which such secreted expression is accomplished. 

EPA 84303755.7 however does describe St reptomyces 1 i vi dans 
66 containing plasmid pld703 which was deposited on June 6, 1983 into 
the American Type Culture Collection under Accession No. 39378* 
pIJ703 is identical to pIJ702, which is described further hereinbelow, 
except that the fragments containing the gene coding for tyrosinase 
determinant (mel ) and the gene coded for thiostreptone resistance 
(tsx) are transposed as described in Katz et a!., J. Gen. Microbiol. 
129:2703-14 (1983). 

' Gray et a!., Gene 32: 22-30 (1984) discloses the production 
of bovine growth hormone using a DNA sequence that, in the authors 1 
estimation, probably contains the control regions of the S. fradiae 
aph gene which leads to the intracellular accumulation of a product 
that is shown to be a bovine growth hormone by immunoblotting and 
radioimmunoassay. The aph promoter, in plasmid pEC23 3 appears to be 
the same as that found in pEC24 of the previous reference. 



WO 88/07079 



PCT/US88/00374 



3 

Keiser et al • in MoT, Gen. Genet. , 185:223-238 (1982), 
disclose a number of broad host range multicopy plasmids suitable for 
use as cloning vectors in Streptomyces . In particular, pTasmid pIJlOl 
is disclosed. Katz et al. disclosed a number of plasmids derived from 
5 pIJlOl and in particular disclosed the construction of pIJ702 which 
has two markers, tsr and mel . pIJ702 is the same plasmid as pIJ703, 
ATCC #39378 except that the Bell fragment containing the tsr and mel 
genes are oriented in the opposite direction. 

A number of Streptomyces signal peptide sequences are 
10 reported in the literature. The signal sequence peptides listed in 
Table I appear to be functional for the secretion of the homologous 
protein to which they are attached prior to secretion in nature. In 
general, these Streptomyces signal peptide sequences have a typical 
structure for signal peptides. This general structure is 
15 characterized by a short stretch of N-terminal residues containing 
several positively charged residues, followed by a segment of neutral 
mostly hydrophobic residues followed, frequently, by a proline residue 
at the -4 to -7 position. The signal peptidase usually cleaves after 
the Ala-x-Ala sequence at the -3 to -1 location, wherein x at -2 is an 
20 amino acid that may vary. In the signal sequences identified in Table 
I, for example, x may be proline, serine, histidine, arginine, or 
alanine. The +1 position is taken as the amino terminal residue of 
the mature protein. 
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Streptomyces offer a number of advantages for the production 
of recombinant proteins and it would therefore be desirable to 
construct vectors suitable for secreting heterologous gene products in 
Streptomyces . Among the desirable characteristics of Streptomyces are 
5 the following, they are in general non-toxic and do not cause disease 
in man or animals. Furthermore, Streptomyces have been used in 
industrial fermentation processes to produce antibiotics for many 
years and as a result there is considerable knowledge and experience 
in large scale Streptomyces fermentations. Indeed, the economics of 

10 production of proteins with Streptomyces is so favorable that enzymes 
useful in food production may be produced as commodities using 
Streptomyces industrial fermentations and purifications. Streptomyces 
may be grown to high culture densities and will actively synthesize 
and secrete homologous products for several weeks after the completion 

15 of agitated growth. Despite the clear advantages of the use of 
Streptomyces for the production of recombi nantly produced proteins, 
there has not, heretofore been reported an expression vector capable 
of secreting heterologous proteins of eukaryotic, :f particularly 
mammalian, viral or prokaryotic origin in Streptomyces . 

20 In one aspect, the invention concerns recombinant vectors 

suitable for producing and secreting heterologous proteins in 
Streptomyces . In general, these recombinant secretion vectors 
comprise a plasmid replicable in Streptomyces which further comprises 
a promoter functional in Streptomyces and a DNA sequence encoding a 

25 secretion signal sequence that is operable in Streptomyces and may be 
operably linked to a heterologous gene under the control of said 
promoter. In a preferred embodiment, the DNA sequence encoding the 
secretion signal sequence further includes a site for insertion of a 
heterologous gene to produce a protein that is processed by the 

30 secretion signal. In general, high level promoters, preferably from 
Streptomyces organisms, that control the initiation of many mRNA 
transcripts of the gene which they control are preferred. Among the 
promoters that may be used in the secretion vectors according to the 
invention are the promoter of the aminoglycoside phosphotransferase 

35 gene (aph ) promoter, the ermE promoter, and modified versions of the 
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ermE promoter which are described in further detail hereinbelow. The 
secretion signal sequences that may be used in the secretion vector* 
according to the invention, are engineered from proteins that are 
secreted by Streptomyces and include the secretion signal sequence of 
5 the a -amylase gene of Streptomyces 1 imosus (amy signal sequence) and 
the signal sequence of the 0RF438 gene (0RF438 signal sequence). 
Other exemplary Streptomyces signal sequences that may be useful in 
the vectors according to the invention include those found in Table I. 

In another aspect the invention relates to a recombinant DNA 
10 sequence comprising a high level promoter operable in Streptomyces 
which is operably linked to a secretion signal sequence operable in 
Streptomyces . In a preferred embodiment, the secretion signal 
sequence will have at least one site for the insertion of a DNA 
sequence encoding a desired heterologous protein. The insertion site 
15 is postioned such that the signal sequence and heterologous gene 
product, when translated in a Streptomyces host, produce a protein 
that is processed, i.e., cleaved from the secretion signal sequence 
and preferably secreted by the Streptomyces host. 

In another aspect of the invention, the invention relates to 
20 the foregoing secretion vectors into which a gene encoding a DNA 
sequence foreign to Streptomyces has been li gated so that a protein 
comprising the signal secretion signal sequence fused to the 
heterologous protein is produced and secreted. 

In yet another aspect, the invention concerns Streptomyces 
25 transformed with the foregoing secretion vectors, which transformed 
Streptomyces secrete the product encoded by the foreign DNA sequence. 

The invention will be better understood in connection with 
the following figures. 

Figure 1 is the sequence of the ermE promoter as determined 
30 by Bibb et al., Gene, _38;215-Z26 (1985); Gene , 41:E357-368 (1986). 
The "TGG" sequence removed by site-specific mutagenesis to produce 
ermE is overlined with the triangle. 
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Figure 2 is the sequence of the aph promoter as determined 
by Bibb et al., Molec. Gen, Genet. 199 :26-36 (1985). 

Figure 3A is a composite restriction map of the fragments 
containing the amy gene which includes 2.3-kb Sstl-EcoR I fragment from 
5 pIJ702.No5SL, and the 2.25-kb KpnI-Kpnl fragment from pIJ2921.5L. 

Figure 3B shows the position relative to Figure 3A of the 
transcription start site (wavy line) the direction of transcription 
(arrow), the coding sequences for the signal peptide (hatched box) and 
the mature amylase (open box). 

10 Figure 3C shows the amy gene fragment (thin line) in plasmid 

pSYC1388. Sequence derived from pIJ702 is shown in wavy line. 

Figure 4 shows the nucleotide sequence of the entire amy 
gene from 1 imosus . The upstream open reading frame (ORF) protein 
is translated. The inverted repeat sequence is marked above by 
15 >>><<<. The transcription start site is identified by ! and the "-10" 
and "-35" regions in the promoter are identified by overlining. The 
deduced amino acid sequence of the putative preamylase protein is 
numbered from the initiation methionine. The arrow marks the junction 
of the amylase signal peptide and mature protein sequences. 

20 Figure 5 is a schematic representation of the procedure used 

for producing the aph -amy signal sequence and ermE-amy signal sequence 
Streptomyces secretion vectors. In this figure, as in all of the 
subsequent figures, a restriction site followed by (r) means that the 
site was repaired by filling in the cut end using Klenow fragment and 

25 dNTPs. If followed by (SI), the cut end was trimmed with SI 
exonuclease. If followed by a (t) 3* single-stranded cut ends were 
removed by the exonucleolytic action of the Klenow fragment. All 
three procedures yield blunt ends. 

Figure 6 shows the primary nucleotide sequence of the S_^ 
30 anti bi oti cus ty rosi nase fragment. Upward arrow ( ) denotes potential 
signal sequence cleavage site in 0RF438 gene product. Underlined 
nucleotide (nt) sequence denotes: nt 138-148 = -35 region; nt 163-169 
= -10 region; nt 258-263 and nt 730-733 = RBS. Asterisks (***) 
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specify terminator codons of 0RF438 and of me]_ genes. 0RF438 extends 
from nt 270 to nt 707. 

Figure 7 is a schematic representation of the procedure used 
for producing the aph-ORF secretion signal Streptomyces secretion 
5 vectors pSYC1289 and pSYC1290. 

Figure 8 is a schematic representation of the procedure used 
for producing secreted tumor necrosis factor using aph-amy secretion 
signal and ermE-amy secretion signal Streptomyces secretion vectors. 

Figure 9 shows a 12.5% SDS-PAGE of TCA precipitated culture 
10 supernatant from cultured Streptomyces lividans TK24 transformed with 
pSYC1414. 1.1 ml supernatant of cells grown in R6 medium were 
precipitated with final concentration of 20% TCA. Pellets were washed 
with acetone: ethanol (1:1) twice and resuspended in sample cracking 
buffer. Lane 1, ermE_ promoter- amy signal peptide - TNF (pSYC1414). 
15 Lane 2, negative control (pIJ702 with inactivated mel gene). Lane 3, 
low molecular weight marker. 

Figure 10 is a schematic representation of- the procedure 
used for producing E. coli alkaline phsophatase in Streptomyces using 
the aph- ORF secretion vector.. 

20 Figure 11 is a 12.5% SDS-PAGE of concentrated culture 

supernatant of pSYC1328 which comprised the aph promoter - 0RF433 
signal peptide - alkaline phosphatase fusion. Lane 1, low molecular 
weight marker. Lanes 2, 3 and 4, pf130 membrane concentrated from 0.3 
ml, 0.6 ml and 1.5 ml supernatant, respectively. Lane 5, 20% TCA' 

25 precipitated from 0.8 ml supernatant. Lane 6, 0.5 ug purified 
bacterial alkaline phosphatase (SIGMA). 

Figure 12 shows the synthetic Ttx gene. The single-stranded 
synthetic fragments are identified with double lines, and are numbered 
at the 5' ends. The expected amino acid sequence is shown in single- 
30 letter observations recommended by the IUPAC-IUB commission on 
Biochemical Nomenclature. 

Figure 13 is a SDS-PAGE analysis of the extracellular 
proteins from TK24 (pSYC1483). Lanes (1), molecular weight standards, 
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as marked, in Kilodaltons (Kd); (2) and (3), extracellular proteins 
precipitated from cultures of two independent isolates on day 5; (4) 
and (5), samples from day 7; (6) sample from TK24. The arrow marks 
the 16-kd protein band. 

5 Figure 14 is a schematic representation of the procedure 

used for producing the ermE- ORF Streptomyces secretion vectors used to 
express and secrete lymphotoxin. 

Figure 15 is a schematic representation of one process used 
for producing the ermEl a-ORF Streptomyces secretion vector used to 
10 express and secrete TNF. 

Figure 16 is an SDS-PAGE gel showing on analysis of secreted 
and cellular proteins produced by Streptomyces secretion vectors 
secreting TNF. 

Figure 17 is a Western Blot of cellular and secreted 
15 proteins produced by Streptomyes secretion vectors secreting TNF. 

A. Def i niti ons 

In the detailed description of the invention, the following 
terms are employed. 

"Plasmid" as used herein means a covalently closed, 
20 generally double-stranded circular DMA sequence which comprises an 
intact rep! icon or replicating unit which is replicated in a host 
cell . 

"Promoter" as used herein is meant one of a number of DNA 
sequences involved in the control of the initiation of transcription 

25 of a gene. In general, a promoter is a DNA sequence which is involved 
in the initiation of transcription of a DNA sequence encoding a 
protein. Promoters frequently have a number of distinct structural 
features and may, in addition, be regulated by operators which control 
the initiation of transcription or attenuators, which are involved in 

30 the sustaining of transcription. Most promoters include a 
Shi ne/Dal garno sequence, however this sequence is not found in all 
promoters, as is known with respect to, for example, the aph promoter. 
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^Control sequences 11 as used herein refers to DNA sequences 
necessary for the expression of an operably linked coding sequence in * 
the Streptomyces host organism. The control sequences which are 
suitable for Streptomyces include a promoter, a ribosome binding site * 
5 and, in general, the Shine/Dal garno sequence, although it is known, 
for example, that the aph promoter may operate and generally does 
operate in the absence of any definable Shine/Dal garno sequence. 

"Streptomyces host" as used herein is meant any strain of 
Streptomyces in which the recombinant vector according to the 

10 invention is functional. pIJ702 and derivatives thereof have been 
found to be functional in every strain of Streptomycetes into which 
they have been transformed. Among the Streptomyces hosts contemplated 
within the definition are the following: S. lividans, S. coeli color 
A3 (2), S. griseus , S. parvulus, S. a! bus G, S. yinaceus 3 S. 

15 acrimycinis, S. cal vuligerus , S. limosus, S. rubiqinosis , S. azureus , 
S. glaucenscens, S^ rimosus, S. violaceotuber, S. kanamyceti cus or any 
streptomycetes host in which the rep! icon of PIJ702 is functional . By 
"transformed" or "transf ormants" is meant host cells into which a 
recombinant vector has been introduced and which maintains the vector 

20 therein, usually in the presence of an agent, such as an antibiotic, 
the resistance to which is conferred by the transforming plasmid. 

"Operably linked" as used herein refers to the juxtaposition 
of control sequences, such as a promoter ribosome binding site in a 
manner that the normal function of the components is maintained. 

25 Thus, a coding sequence "operably linked" to an encoded secretion 
signal sequence refers to a configuration wherein the coding sequence 
is joined to the signal sequence in such a manner that the signal 
peptide is processed by the host cell and the process protein 
secreted. A secretion signal sequence operably linked to a promoter 

30 is joined to the promoter in such a manner that the transcription and 
translation of the secretion signal sequence is controlled by the 
promoter, ribosome binding site, and Shine/Datgarno sequence if 
required. 
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"Heterologous" when used herein in connection with DNA 
refers to DNA sequences that originate in an organism which is a 
different species and usually a different genus than the species or 
genus of the host cell into which the heterologous DNA is placed. 
5 With respect to DNA encoding a protein or heterologous gene, 
heterologous as used herein means DNA from an organism other than the 
genus Streptomyces . "Homologous" as used herein with respect to DNA 
encoding a protein means DNA from an organism of the genus 
Streptomyces . 

10 B . General Methods for Carrying out the Invention 
Transformations 

Depending on. the host cell used, transformation is done 
using standard techniques appropriate to such cells. The calcium 
treatment employing calcium chloride, as described by Cohen, S. N., 

15 Proc. Natl. Acad. Sci . (USA) (1972) 69:2110, or the RbCT 2 method 
described in Maniatis, et a!., Molecular Cloning: A Laboratory Manual 
(1982) Cold Spring Harbor Press, p. 254 was used for procaryotes or 
other cells which contain substantial cell wall barriers. 

Growth of Streptomyces and molecular biological methods 

20 particularly useful in connection therewith are described in Hopwood 
et a!., Genetic Manipulation of Streptomyces A Laborat ory Manual 
(1985). The John Innes Foundation. 

Vector Construction 

Construction of suitable vectors containing the desired 

25 coding and control sequences employs standard ligation and restriction 
techniques which are well understood in the art. Isolated plasmids, 
DNA sequences, or synthesized oligonucleotides are cleaved, tailored, 
and re! i gated in the form desired. 

Site specific DNA cleavage is performed by treating with the 

30 suitable restriction enzyme (or enzymes) under conditions which are 
generally understood in the art, and the particulars of which are 
specified by the manufacturer of these commercialy available 
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restriction enzymes. See, -e.g.-, New England Biolabs, Product 
Catalog. In general, about 1 yg of plasmid or DNA sequence is cleaved 
by one unit of enzyme in about 20 yl of buffer solutions. In the 
examples herein, typically, a 3-10 fold excess of restriction enzyme 
5 is used to insure complete digestion of the DNA substrate. Incubation 
times of about one hour to two hours at 37 °C or other appropriate 
temperatures are workable, although variations can be tolerated. 
After each incubation, protein is removed by extraction with 
phenol /chloroform, and may be followed by ether extraction, and the 

10 nucleic acid recovered from aqueous fractions by precipitation with 
ethanol followed by running over a Sephadex G-50 spin column. If 
desired, size separation of the cleaved fragments may be performed by 
polyacyl amide gel or agarose gel electrophoresis using standard 
techniques. A general description of size separations is found in 

15 Methods in Enzymol ogy (1980) 65j499-560 or Maniatis, Molecular 
Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1982). 

Restriction cleaved fragments may be blunt ended by treating 
with the large fragment of E. coli DNA polymerase I (Klenow) in the 

20 presence of the four deoxynucleotide triphosphates (dNTPs) using 
incubation times of about 15 to 25 minutes at 20 to 25°C in 50 mM Tris 
pH 7.5, 50 mM NaCl , 6 mM MgCl 2 , 6 mM DTT and 5-10 yM dNTPs. The 
Klenow fragment fills in at 5' overhangs but chews back protruding 3 1 
single strands, even though the four dNTPs are present. If desired, 

25 selective repair can be performed by supplying only one of the, or 
selected, dXTPs within the limitations dictated by the nature ,of the 
sticky ends. After treatment with Klenow, the mixture is extracted 
with phenol/chloroform and ethanol precipitated. Treatment under 
appropriate conditions with ST nuclease results in hydrolysis of any 

30 single-stranded portion. 

Synthetic oligonucleotides are prepared by the triester 
method of Matteucci , et al . (J. Am. Chem. Soc. (1981) 103 :3185) or 
using commercially available automated oligonucleotide synthesizers. 
Kinasing of single strands for labeling is achieved using an excess, 
35 e.g., approximately 10 units of polynucleotide kinase to 10 pmole 



WO 88/07079 



PCT/US88/00374 



13 

substrate in the presence of 50 mM Tris, pH 7.6, 10 mM MgCl 2 , 5 mM 
dithiothreitol, 40 pmoles y-32P-ATP (3000 Ci/mmole), 0.1 mM 
spermidine, 0.1 mM EDTA. 

Ligations are performed in 15-30 yl volumes under the 
5 following standard conditions and temperatures: 20 mM Tris-Cl pH 7.5, 
10 mM MgCl 2 , 10 mM DTT, 33 yg/ml BSA, 10 mM-50 mM NaCl , and either 40 
P M ATP, 0.01-0.02 (Weiss) units T4 DNA ligase at 0°C (for "sticky end" 
ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA ligase at 14°C 
(for "blunt end" ligation). Intermol ecul ar "sticky end" ligations are 

10 usually performed at 33-100 ug/ml total DNA concentrations (5-100 nM 
total end concentration). Intermol ecul ar "blunt end" ligations 
(usually employing a 10-30 fold molar excess of linkers) are performed 
at 1 yM total ends concentration. 

In vector construction employing "vector fragments", the 

15 vector fragment is commonly treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5' phosphate and prevent 
religation of the vector. BAP digestions are conducted at pH 8 in 
approximately 150 mM Tris, in the presence of Na + and Mg +2 using about 
1 unit of BAP per yg of vector at 60°C for about one hour. Vector 

20 fragments subjected to this treatment are referred to herein as 
"bapped". If unkinased ol i godeoxyri bonucl eoti des are used however, 
the vector fragments are not "bapped". In order to recover the nucleic 
acid fragments, the preparation is extracted with phenol /chloroform 
and ethanol precipitated and desalted by application to a Sephadex G- 

25 50 spin column. Alternatively, religation can be prevented in vectors 
which have been double digested by additional restriction enzyme 
digestion of the unwanted fragments. 

For portions of DNA which require sequence modifications, 
site specific primer directed mutagenesis is used. This is conducted 
30 using a synthetic oligonucleotide primer complementary to a single 
stranded phage DNA to be mutagenized except for limited mismatching, 
representing the desired mutation. Briefly, the synthetic 

oligonucleotide is used as a primer to direct synthesis of a strand 
complementary to the phage, and the resulting double-stranded DNA is 
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transformed into a phage-supporting host bacterium* Cultures of the 
transformed bacteria are plated in top agar, permitting plaque 
formation from single cells which harbor the phage. 

Theoretically, 50% of the new plaques will contain the phage 
5 having, as a single strand, the mutated form; 50% will have the 
original sequence. The resulting plaques are hybridized with kinased 
synthetic primer at a temperature which permits hybridizaton of an 
exact match, but at which the mismatches with the original strand are 
sufficient to prevent hybridization. Plaques which hybridize with the 
10 probe are then picked, cultured, and the DNA recovered. Details of 
site specific mutation procedures are described below in specific 
exampl es . 

Verification of Construction 

In the constructions set forth below, correct ligations for 
15 plasmid construction are confirmed by first transforming E« col i 
strain MM294 obtained from E. coli Genetic Stock Center, CGSC 6135, or 
other suitable host with the ligation mixture. Successful 
transformants are selected by ampicillin, tetracycline or other 
antibiotic resistance or using other markers depending on the mode of 
20 plasmid construction, as is understood in the art. Plasmids from the 
transformants are then prepared according to the method of Clewell, D. 
B., et a!., Proc. Natl. Acad. Sci. (USA) (1959) j52_:1159, optionally 
following chloramphenicol amplification (Clewell, D. B,, J, Bacterid . 
(1972) 110: 667) . The isolated DNA is analyzed by restriction enzyme 
25 mapping and/or sequenced by the dideoxy method of Sanger, F., et a!., 
Proc. Natl. Acad. Sci. (USA) (1977) 2i :5463 as further described by 
Messing, et al., Nucleic Acids Res. (1981) 9:309, or by the method of 
Maxam, et al., Methods in Enzymology (1980) 65:499 . 

Hosts Exemplified 

30 Host strains used in cloning and expression herein are as 

fol 1 ows : 
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For cloning and sequencing, and for expression of 
construction under control of most bacterial promoters, E. coli strain 
MM294 (supra), Talmadge, K. , et al., Gene (1980) 12:235; Messelson, 
M., et al., Nature (1968) 217 :1110, was used as the host. For 
expression under control of the N-RBS promoter, E. coli strain K12 
MC1000 lambda lysogen, N 7 N 5 3cI857SusPgQ, ATCC 39531 (hereinafter 
sometimes referred to as MC1000-39513 XDG95 or DG95) may be used as 
well as E, coli strain DG116 also an MM294 strain (x CI857, bio T76, 
del HI); the bio T76 substitution delets early X function (N att) 
and the del HI deletion removes x DN from cro through att (del cro- 
Bio + n~). This strain is deposited in the assignees culture 
collection under accession number CMCC 2298. For expression using pUC 
vectors E. coli DG99 may be used. DG99 has been deposited in the ATCC 
under Accession No. 39766. 

For M13 phage recombinants, E. coli strains susceptible to 
phage infection, such as E. coli K12 strain DG98 are employed. The 
DG98 strains has been deposited with ATCC July 13, 1984 and has 
accession number 1965. 

C. Specific Description of the Invention 

20 The Streptomyces secretion vectors of- the present invention 

comprise a plasmid replicable in Streptomyces , comprising a high level 
promoter operably linked to a DNA sequence encoding a Streptomyces 
signal sequence, wherein the DNA sequence encoding the secretion 
signal sequence may be operably linked to a heterologous DNA or gene 

25 encoding a desired protein to produce the desired protein in operable 
linkage with the Streptomyces signal sequence, and recite the desired 
protein. In a preferred embodiment, the DNA sequence encoding the 
secretion signal sequence further includes a site for insertion of the 
heterologous gene in operable linkage with the secretion signal 

30 sequence. High level promoters useful in the secretion vector 
according to the invention may be defined as promoters that are 
functional in Streptomyces to produce, in general, a Streptomyces gene 
product. High level promoters of this type are exemplified by 
promoters involved in the production of Streptomyces antibiotics such 
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as aminoglycoside phosphotransferase (aph ) , neomycin 

phosphotransferase ( neo) and kanamycin phosphotransferase ( kan ). In 
addition, the promoters of genes which produce resistance to 
antibiotics, are also known to be high level promoters. Such high 
5 level promoters are exemplified by the erythromycin resistance gene 
(ermE) . Such high level promoters are suitable for use in conjunction 
with a Streptomyces signal sequence and control the expression 
thereof. 

Also within the scope of the invention, are modifications to 

10 such high level promoters thay may render them either more convenient 
or more effective in controlling the initiation of transcription and 
expression of the genes under their control* In the present 
invention, the high level promoter may be modified. For example, the 
ermE promoter is known to be a high level promoter.' The ermE promoter 

15 has been modified to produce a promoter designated ermE l a. The 
sequence of the ermE promoter has been determined by Bibb et a!., Gene 
38:215-226, amended by the publisher in Gene 41 :E357-E368 (1986). The 
modification of the ermE promoter to produce ermE l a is carried out by 
site-specific mutagenesis to remove the trinucleotide "TGG" in the (- 

20 35 region) of the PI promoter of the ermE. As is shown in Figure 1, 
ermE has a number of overlapping promoters. ermE l a has been shown by 
Bibb et al . to yield greater levels of production of homologous 
Streptomyces gene products. In the present invention he ermE l a 
promoter has been further modified at the 3 1 end thereof to permit 

25 ligation of the ermE l a promoter to suitable Streptomyces signal 
peptides to create one embodiment of the Streptomyces secretion vector 
according to the invention. The 3 1 end of the ermEl a promoter is 
modified by first creating an Sphl restriction site at the end of the 
ermEl a sequence by site-specific mutagenesis, followed by the removal 

30 of single-stranded DMA and digestion with EcoRI. The purified 
modified ermEl a promoter may be conveniently li gated with a suitable 
signal sequence as is shown in greater detail hereinbelow. 

The aph promoter as mentioned above, is also a high level 
Streptomyces promoter. The aph promoter may also be modified so that 
it may be conveniently li gated to a signal sequence to produce the 
35 Streptomyces secretion vector coding to the invention. 
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The promoter of the aph gene of $j_ f radiae has been 
sequenced; see Bibb et aU 5 Molec. Gen, Genet. 199 :26-36 (1985). The 
sequence of the aph promoter is shown in Figure 2 hereof. A shortened 
fragment of the aph promoter was obtained by digestion of the aph 
5 promoter with BssH II and Ncol endonucl eases , followed by repair of the 
single-stranded termini with Klenow fragment and dNTPs. 

It will be appreciated from the foregoing that it may be 
necessary to make certain modifications to the high level promoter 
used to control the expression of the Streptomyces signal sequence in 
10 the vectors according to the invention. Such modifications are 
considered to be within the scope of the invention. 

The Streptomyces signal peptide sequences appropriate for 
use in the present invention are exemplified by the signal secretion 
sequence of the 0RF438 protein and the a-amylase signal peptide of S^ 

15 1 imosus . The invention however is not consi dered to be l imited to 
these two secretion signal sequences. The secretion signal- sequences 
of jS^ pi i catus endoH , S. avidinii streptavidin and coel icol or 

agarase are also believed to be suitable for use in vectors according 
to the invention. The signal peptide sequences are also modified in 

20 order to provide sites for insertion of heterologous genes in operable 
linkage with the secretion signal sequence. In general, the DNA 
encoding the 3' end of the signal sequence and the sequence encoding 
the NH2 terminal of the mature homologous protein that is normally 
produced under the control of the secretion signal sequence, are site- 

25 specifically modified to provide a number of restriction sites for the 
insertion of the DNA encoding the desired heterologous gene. The 
insertion sites are produced as restriction sites which allow the 
ligation of the DNA sequence encoding the desired heterologous gene to 
produce a fusion of the protein to the signal sequence encoded thereby 

30 preferably at the -1 or +1 residues wherein the -1 residue is the last 
residue of the secretion signal sequence and the +1 residue is the 
first residue of the mature desired heterologous protein. The j- 
foregoing modifications to the promoter or the secretion signal 
sequence, may be carried out of course by a variety of means; 

35 Oligonucleotide primer directed site-specific mutagenesis may be used 
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to modify the promoter or the signal secretion sequence as is shown in 
detail hereinbelow. However, other means may be used to produce the 
desired DNA sequence for the promoter or the signal secretion 
sequence. Indeed, it is possible to produce modified promoters or 
5 secretion signal sequences by synthesizing oligonucleotides encoding 
the required sequence and preferred restriction sites through various 
automated DNA synthesizers. A variety of DNA sequences encoding 
heterologous proteins may be expressed using the prominent secretion 
signal sequences of the present invention. Exemplified hereinbelow 

10 are but two of the human proteins, lymphotoxin and tumor necrosis 
factor that may be expressed and secreted using the vectors according 
to the invention. In addition, the procaryotic protein alkaline 
phosphotase obtained from E« coli is also produced in Streptomyces and 
secreted therefrom using the secretion vectors according to the 

15 invention. 

The following examples are intended by the inventor to be 
exemplary and non-limiting. 

Example I 

I. Determination of the nucleotide sequence of the promoter proximal 
20 region of the a-amyl ase gene of S. 1 imosus and the signal peptide 

sequence of the pre a-amyl ase. 

A. Materials 

Plasmid pIJ702 is a 5.8-kb, high copy number Streptomyces 
cloning vector (I). It is derived from S. lividans strain 66, and 
25 carries the tsr (thiostrepton resistance) and the mel (tyrosinase) 
genes. See Katz et al., J. Gen. Microbiol, (supra). pIJ702 can be 
obtained by reversing the orientation of the tsr, mel Bel fragment of 
pIJ702, ATCC #39378. In addition pIJ702 is available as ATCC ^ 
#37287. See p. • 251, ATCC Catalogue, 16th ed. (1985). 

30 Plasmid pTJ702.No512 is the original Streptomyces 1 imosus 

amylase clone as a 6-kb insert in pIJ702. The pIJ2921.12SL plasmid is 
a pt/C18 derivative of pIJ702.No5SL containing a 2.25-kb subcloned amy 
fragment flanked by Kpnl sites. pUCIS is described in Gene 26:101-106 
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(1983) and is the same as pUC19 except that the polyl inker sequence is 
in the opposite orientation. pUC19 is commercially available. See 
New England Biolabs Catalogue. The host strain used is S. 11 vi dans 
TK24 ( pro-2 , str-6 , 5LP2~ , SLP3" ) ; see Hopwood et al., J. Gen. 
5 Microbiol. 129 :2257-2269. Other vectors are those frequently used in 
E. col i . 

B. Mapping of the insert in pIJ2921.12SL 

The restriction map of the insert of pIJ2921.12SL is shown 
in Figure 3. The location of the transcription start site maps just 
10 inside the insert in pIJ2921.12SL. The 5' overlapping fragment (2.3- 
kb Sstl-EcoRI) from pIJ702.No5SL that contained the amy promoter 
sequence, was also cloned and used for mapping and sequence analysis. 

C. Nucleotide sequence 

Figure 4 shows the nucleotide sequence of the entire amy 
15 gene and the deduced amino acid sequence. Fragments containing 
portions of the amy gene were subcloned and the sequence determined by 
the dideoxynucleotide chain termination method. (See Messing et al., 
Nuc. Acid. Res. _9:309 (1981).) Additional sequencing primers were 
made according to preliminary sequence information, and they were used 
20 to generate further sequence data. The sequence shown here has been 
determined from both strands. 

D. Reconstruction and cloning of the amy gene into pIJ702 

To clone the amy promoter and the coding sequence of the amy 
gene with convenient flanking restriction sites, the following 
25 digestions and ligations were performed: 

1. The 74-bp Apal-Kpnl fragment of pIJ702.No5SL was 
inserted into the Hinc II-Kpnl site of pUC19 plasmid. This introduced 
the SphI site 5' to the original Apal site located next to the amy 
promoter. The resulting intermediate plasmid was designated pSYC1344. 

30 2. The 2.06-kb Kftnl-BssHIT fragment of pIJ2129.12SL (the 

BssHII end was. first made flush by E_^ coli DNA polymerase Klenow 
fragment repair reaction) was isolated. It was cloned into the Kpnl - 
HinCII region in pUC19. This introduced the PstI site at the 3' end 
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of the amy sequence. The resulting intermediate plasmid was 

designated as pSYC1346. , 

3. The two amy segments in these two plasmids were excised 
by Sphl-Kpnl and KpnI-PstI , respectively. The Sphl-Kpnl and KpnI-PstI * 
5 fragments were 11 gated to the large Sphl-PstI fragment of pIJ702 and, 
resulted in plasmid pSYC1388 (see Figure 3C). Plasmid pSYC1388 has 
been deposited in the assignees culture collection as CMCC No. 3027, 
and has been deposited in the American Type Culture Collection on 
March 4, 1987. 

10 E . N-termi nal sequence of mature a-amylase 

Plasmid pSYC1388 was introduced into 3. lividans strain TK24 
by transformation. Transformations are picked by the loss of the dark 
colored phenotype conferred by the mel gene. One transformant was 
picked and grown in R6 medium at 30 °C for 3 days. Cells were removed 

15 from the culture by cent rifugat ion followed by filtration through 
mi Hi pore membrane. The exoproteins in the filtrate were concentrated 
and dialyzed by dialf iltration using a PM30 membrane. The sample 
containing the Streptomyces exoproteins was further fractionated on a 
Sephacryl S-200 column using sodium acetate buffer, pH 5.5, 2 mM DTT 

20 and 0.1% SDS. The purified a-amylase protein was subjected to ami no- 
terminal sequence using an ABI gas-phase sequencer, and revealed the 
fol 1 owi ng sequence : 

Al a-Pro-Pro-Gly-Ala-Lys-Asn 

This sequence is in total agreement with the deduced sequence shown in 
25 Figure 4 and establishes the processing site by the signal peptidase 
as shown in that figure. 

The following features of the amy promoter result from the 
sequencing data. 

The transcription start site is the 6 located at position 
30 243. This suggested that the M -35 n and the "-10" sequences of the amy 
promoter are TTGACC and TACGGT, respectively. 

The likely trans! at i on al start is at position 311. The ATG 
triplet is preceded by the putative S/D sequence GGAGG located 
upstream. 
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The signal peptide starts from met, and ends at the 28th 
residue (the Ala at 

An open reading frame (ORF) is located at the 5' end of this 
fragment. It is followed by an inverted repeat resembling the rho- 
5 independent transcription terminator sequence. 

II. The construction and use of Streptomyces secretion vectors 

A. Introduction and Materials 

Vectors with the coding sequences of Streptomyces signal 
peptides under the control of high level promoters were constructed 
10 for the production of secreted heterologous proteins as follows: 

The amylase gene from S. limosus ( amy[ Sli]); in pIJ2921.12 
SL and pIJ702 were described in the previous example. The 0RF438 gene 
is located upstream of tyrosinase gene (mel_, for melannin) on 
pIJ702. See Bovnan et a!., Gene 37:101-110 (1985). Plasmid pU61 is 
15 a 14.8-kb low copy number Streptomyces cloning vector. It is derived 
" from the SLP1.2 plasmid f rom 1i vidans 66, and carried the tsr 
(thiostrepton-resi stance) and aph (aminoglycoside phophotransf erase) 
genes. See D. A* Hopwood, M. J. Bibb, K. F. Chater, T. Kiester, C. J. 
Bruton, H. M. Kieser, 0. J. Lydiate, C. P. Smith, J. M. Ward, and H. 
20 Schrempf, 1985, Genetic Manipulation of Streptomyces, A Laboratory 
Manual (The John Innes Foundation). 

pIJ449 is a 3.2-kb plasmid containing the promoter region of 
the ermE (erythromycin-resistance) gene in the poly! inker of pUC18 
derivative. See M. J. Bibb, G. R. Janssen, and J. M. Ward, 1985, Gene 
25 38: 215-226 (amended by publisher in 1986, Gene 41 :E357-E368) . 

B. Cloning and modification of the signal peptide coding 
region of amy (Sli) to produce secretion sites for 
insertion of heterologous DNA 

The promoter-proximal region of the amy (Sli) gene was 
30 sequenced as described above. The 129-bp EcoR I -Smal fragment from 
pIJ2921.12SL was ligated into the EcoRI-Smal sites of RF DNA of 
M13mpl0. This fragment contains the coding sequence for the amylase 
signal peptide. The Ala residue at position 29 (counting from the 
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putative initiation Met) is the last residue in the signal peptide 
prior to the proamyl ase protein sequence and Al a-28 is the "-I" 
residue as shown in Figure 1. The oligodeoxyribonucleotide primer 
SC64 was designed, chemically synthesized and used to modify the % 
5 sequence by site-directed mutagenesis. The primer has the sequence 
5 1 -CCCGGGCGGCGCCGCAGCGGCGGG-3 1 ) . The resulti ng RF DNA (designated 
pSYC1301) of the Ml 3 phage has a new sequence for Narl (which is also 
BanI and Hael l) at the end of the coding sequence for amino acid #29 5 
but still encodes the same peptide sequence. 

10 origi nal sequence: . . . .CCC 5 GCC,GCT,GCC,GCC,CCG, .... 

pSYClSOl . . . . CCC , GCC , GCT , GCG , GCG s CCG , 

-Pro-Al a-Al a-Al a-Al a-Pro- 

26 27 28 29 (position in precursor) 
15 -3 -2 -1 +1 +2 (position relative to 

cleavage site) 

Digestion with Ban! , and Narl 5 or Hael l of this modified 
sequence permits fusions at the -1 or +1 residues, respectively. This 
allows convenient manipulation to create fusions to the amy signal 
20 sequence. 

pSYC1301 has been deposited in the applicants culture 
collection as CMCC number 3035 and was deposited with the American 
Type Culture Collection on March 4, 1987. The production of pSYC1301 
is shown in Figure 5. 

25 C. The cloning and modification of the signal peptide cod- 

ing sequence of the 0RF438 gene to produce restriction 
sites for the insertion of heterologous DNA 

The 438-codon open reading frame (0RF438) that is linked to 
the mel gene has been described by Bernan et al., Gene , supra. As 
30 indicated by the authors, the 0RF438 gene could potentially encode a 

secreted protein because its ami no-terminal sequence resembles a ^ 
signal peptide sequence. The signal peptidase cleavage site of the 
putative secretory protein was speculated to be between Ala-30 and * 
Asp-31 as shown in Figure 6. 
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The Pvul site (CGATCG) of the 0RF438 putative signal 
sequence was used for fusion of heterologous coding sequences at the 
+2 residue. In addition, the ol i odeoxyri bonucleotide primer SC69 was 
used to introduce a Narl site at the putative signal peptidase 
5 cleavage site by . site-specific mutagenesis using an M13 vector. The 
primer had the sequence: 5 ' -CGCCGCCCGGGCGCCCGATCGGGG-3 ' . 

One procedure used was as follows and is shown schematically 
in Figure 7. 

The 433-bp BclI -BglH fragment of PIJ702 that contains the 
10 entire putative signal peptide coding sequence of 0RF438 was cloned 
into the BamH I site pUC13. Insertion in both orientations were 
obtained; the 433-bp BclI-BglH fragment included the Hind i site just 
upstream from tjie putative Shi ne/Dal garno sequence of 0RF438: 

pSYC1259 (also pSYC1260). Two plasmids were obtained as follows: 

15 EcoR I -SstI . . . ( Bam/Bgl ) -Hind I -0RF438— Pvul — (Bel /Bam ) . . . Xbal -Sal-PstI - 
Hindi 1 1 ; 

pSYC1261 (also pSYC1262): 

Hi ndl 1 1 -Pst-Sall -Xbal . . . ( Bam/Bgl ) -Hind I -0RF438— Pvul — 
TBc"l/Bam"T T7.SstI -EcoR I . 

20 The EcoRI -Hindl ll fragment from pSYC1261 was cloned into 

M13mpll at the corresponding sites, and SC69 primer was used to 
generate a Narl site by primer directed site specific mutagenesis. 
The following changes resulted: 

-3 -2 -1 +1 +2 +3 
25 -Pro-Ala-Ala-Arg-Ala-Asp-Asp-Arg-Gly- 

0RF438: . . . , CCC , GCC , GCC , CGC , GCG , GAC , GAT , CGG , GGG , 

* ** 

pSYC1392: . .., CCC, GCC, GCC, CGG, GCG, CCC, GAT, CGG, GGG, 

-Pro- 

30 This mutation created a Narl site and changed the Asp at 

to a Pro. The RF DNA of the M13 phage was designated pSYC1392. 
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D. Cloning and modification of promoters for linkage to . 
DNA encloding signal sequences for the expression of 
fusion genes 

The promoters for the aminoglycoside phosphotransferase 
5 (aph) gene from S« fradiae (Bibb et al MoU Gen. Genet . (supra) and 
for the erythromycin-resi stance ( ermE ) gene from S. erythreus (Bibb et 
al., Gene, supra) have been seguenced, and the transcriptional 
initiation sites determined as is shown in Figures 1 and 2. 

1. aph -amy fusion: The aph promoter seguence flanked by 
10 the BssHI I and the Ncol sites was excised from plasmid pIJ61. The 
termini of the fragment were repaired with Klenow fragment of DNA 
polymerase of col i , and blunt end H gated into the Smal site of 
plasmid pUC13* The resulting plasmid pSYC1152 contained the aph 
promoter seguence in the orientation that the EcoR I site in the pUC13 
15 polyl inker is located 5 f to the aph promoter. The Ncol site 
originally derived from the aph gene was regenerated. The same 
fragment inserted in the opposite orientation was also obtained, and 
it was designated as pSYCI151. The production of pSYCHSl and 
pSYC1152 are shown schematically in Figure 7. Plasmids pSYCHSl and 
20 pSYC1152 was deposited in applicants culture collection as CMCC #3010 
and 3011, respectively, and were deposited in the American Type 
Culture Collection on March 4, 1987. The polyl inker regions contain 
the seguences as shown below: 

pS YC 1 152 : EcoR I -SstI .aph p romot e r . . < . . Ncol -BamH I -Xbal -Sail -Pst I - 

25 Hindi II ~~ 

pS YC 1 1 51 : Hindi 1 1 -Pst I -Sail -Xbal -BamH I - . . . .aph p romot e r . . . Ncol -SstI - 
EcoR I 

The pSYC1152 DNA was cleaved with Ncol , repaired with Klenow 
fragment and then cleaved by Hindi ! I. The large Hindl ll-blunt end 

30 fragment was purified. The pSYC1301 DNA was cleaved with EcoR I, the 
termini were made flush with SI nuclease, and then digested with 
Hindi I I . The small resulting fragment that contained the modified amy 
(Sli) seguence derived from pSYC1301 was isolated and was then ligated 
to the pSYC1152-deri ved fragment described above. The resulting 

35 plasmid pSYC1309 contains the aph promoter and the coding seguence of 
amylase signal peptide in the following order: 
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pSYC1309: 

EcoRI -Sstl ... aph promoter.. . amy (Sli ) signal peptide... Narl ... — 
Hindl ll. 

Plasmid 1309 was deposited in applicants culture collection 
5 as CMCC #3012 and was deposited in the ATCC on March 4, 1987. The 
plasmid pSYC1309 was confirmed by restriction endonuelease 
digestion. The production of pSYC1309 is shown schematically in 
Figure 5. 

2. ermE promoter -amy fusion: In order to create this 
10 fusion, an SphI site was first created 3'- to the ermE promoter as 
follows. Plasmid pIJ449 was digested with EcoRI and Hindlll. The 
0.59-kb fragment that contained the ermE promoter fragment was cloned 
into the corresponding site in M13mpl0. The 23-nucl eotide 
ol igodeoxyribonucl eotide primer SC65 having the sequence 5'- 
15 CCGAACTGCGCATGCGCTGGATC-3 1 was synthesized on an automatic DNA 
synthesizer and was employed for primer directed site specific 
mutagenesis which generated an M13 derivative with RF DNA that was 
designated as p$YC1303. The following changes in the ermE promoter 
were made: 

20 original ermE : (-10) fMet Ser Ser Ser Asp 

TAGGATCCAGCG GTG AGO AGT TCG GAC 

pSYC1303: TAGGATCCAGCGCATG CGC AGT TCG GAC 



SphI 

25 pSYC1303 was deposited in applicants culture collection as CMCC #3036 
and was deposited in the ATCC on March 4, 1987. 

The pSYC1303 DNA was digested with SphI , the single-stranded 
3' tetranucleotides were removed- by the exonuclease action of DNA 
polymerase. This DNA was then digested with EcoR I and the ermE 
30 promoter fragment was purified. This ermE - de rived fragment, and the 
modified amy fragment prepared and isolated from pSYC1301 by the 
procedure described in the preceding paragraph, were ligated to pUC18 
DNA that has been digested with EcoR I and Hindl ll . The resulting 
plasmid, pSYC1311, had ermE and amy in the following order: 
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PSYC1311: 

EcoR I -SstI • • . ermE promoter . . « amy si gnal pept f de codi n g 
sequence* « . Narl-Hi ndl ll 

The production of pSYC1311 is shown schematically in Figure 
5 5. Plasmid pSYC1311 has been deposited in applicants culture 
collection as CMCC #3013 and was deposited in the ATCC on March 4, 
1987.. 

3. aph- QRF438 fusion: 

Plasmid pSYC1151 carrying the modified aph promoter was 
10 digested with Ncol , the termini repaired with Klenow fragment, and 
then digested with Hindi II « The resulting 0.2-kb fragment containing 
the aph promoter was cloned into plasmid pSYC1260 between the Hindl ll- 
Hind i region, and generated pSYC1289 and pSYC1290. The production of 
pSYC1259, pSYC1260, pSYC1261 and P5YC1262 from plasmid pUC13 and 
15 pIJ702 is shown in Figure 7. Similarly, the aph promoter fragment was 
excised from pSYC1152 by digestion with Ncol , the ends filled in, and 
digested again with Xbal . This fragment was cloned into pSYC1262 
between the Xbal and Hind i sites, resulting in pSYC1291 and pSYC1292. 

pSYC1289/I290: 

20 Hi nd 1 1 1 -Pst I -Sal I -Xba I -BamH I , . . aph promtoer . . .0RF438 signal 
peptide. .« Sst-EcoR I 

PSYC1291/1292: 

EcoR I -SstI .... aph promoter .0RF438 signal peptide. . .Xbal-Sal I-Pstl- 
HTFdl lT 

25 The production of pSYC1289/1290 and pSYC1291/1292 is also 

shown in Figure 7. Plasmids pSYC1259, 1261, 1290 and 1292 were 
deposited in applicants 1 culture collection as CMCC # r s 3016, 3017, 
3018 and 3019, respectively- Plasmids pSYC1261 and pSYC1290 were 
deposited with the ATCC on March 4, 1987. 

30 E. Expression of heterologous genes 

1- TNF 

a. Construction of aph -amy - TNF fusion 

This fusion gene was constructed by ligation of four 
fragments and is schematically shown in Figure 8: (1) the large pUC13 
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fragment generated from EcoR I and SstI double digestion, (2) the Sstl- 
Narl fragment from pSYC1309, which contains the modified aph promoter, 
(3) the amino terminal coding portion of TNF gene from M13 RF DNA 
LL02-4, generated from partial digestion with Hap I I and complete 
5 digestion with BstEII. This 385-bp fragment has a 5'-Hap_II site that 
was created by site-directed mutagenesis and (4) the 490-bp BstEII- 
EcoR I fragment from pAW711D that contains the C-terminal coding region 
of TNF and the cry terminator. 

The resulting' intermediate plasmid pSYC1376 which was 
10 confirmed by fragment sizing of endonuclease digested plasmid contains 
the following sequences: 

HindIII-0 inkers )-SstI.. aph. promoter. .amy_ signal 

pept i de . . TNF . . BamH I . . cry . . EcoR I 

b. Construction of an ermE-amy- TNF fusion 

15 Similarly, the 0.4-kb Sstl-Narl fragment from pSYC1311 that 

contains the ermE promoter was used, and the intermediate plasmid 
pSYC1377 was constructed by the four-fragment .ligation method as 
described in the previous section for the construciton of pSYC1376 and 
as schematically shown in Figure 8. The aph promoter fragment was 

20 substituted by the ermE fragment in pSYC1377. 

The junction sequence between amylase signal peptide and the 
TNF sequence in both pSYC1376 and pSYC1377 is shown below: 

amy-derived TNF-derived 

25 ....amy CCC,GCC,GCT,GCG,GCG,GTC, TNF... 

-Pro-Al a-Al a-Al a-Al a-Val - 

Plasmids pSYC1376 and pSYC1377 were deposited in applicants' 
culture collection as CMCC #'s 3014 and 3015, respectively. 

c. Construction of a replicable vector for expression 
30 and secretion of TNF in Streptomyces lividans: 

The construction of the expression vectors is shown 
schematically in Figure 8. The fragments containing the aph-amy- TNF 
and the ermE-amy- TNF fusions were purified from plasmids pSYC1376 and 
pSYC1377, respectively. These plasmids were first digested with 
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EcoRI, the ends repaired, and then digested with Sstl. The repaired- 
EcoRI -to -SStl fragments were purified. Plasmid pIJ702 was digested 
with PstT , the* ends were made flush by the exonuclease activity of DNA 
polymerase, and then digested with Sstl. The large pIJ702-deri ved 
5 fragment was then It gated separately to each of the two promoter -amy- 
TNF fragments. The resulting Streptomyces plasmids p$YC1412 and 
pSYC1414 carry the aph-amy- TNF and the ermE-amy- TNF fusions, 
respectively. 

S. lividans TK24 carrying pSYC1412 and pSYC1414 were grown 
10 in R6 medium for 3 days. The supernatant from these cultures were 
assayed for TNF biological activity. 1.3 x 10* U/ml of TNF using 
mouse cell line L929 to assay TNF activity, as described in U.S. 
Patent No. 4,667,064 issued June 30, 1987 and assigned to the assignee 
of the present application, was detected in the culture containing 
15 pSYC1412 (aph-amy- TNF) and 3.9 x 10 4 U/ml from pSYC1414 (ermE-amy- 
TNF). Cell density of these cultures was not determined. 

The secreted TNF protein produced by the pSYC1414- 
transformed S. 1 i vi dans strain was also analyzed. The proteins from 
1.1 ml of culture supernatant from the control culture (pIJ702 plasmid 

20 with inactivated mel gene) and from the test culture (pSYC1414 
plasmid) were precipitated by 20% TCA. Samples were then fractionated 
on a 12.5% SDS-PAGE by electrophoresis. The result is shown in Figure 
9* The secreted TNF is estimated to be in the range of 1 mg/1 , which 
is consistent with the bioassay result. This material was purified 

25 and its amino-terminal sequence determined. The junction sequence 
between the amylase signal peptide and TNF is: 

-4 -3 -2 -1 (position in amylase) 

-Pro-Ala-Ala-Ala-Ala-Val -Arg-Ser-Ser-Ser- 

»+l" 

30 in TNF (observed start of 

secreted TNF) 

The observed start is the Ser as marked above. Four 
residues of the mature TNF was lost, presumably due to peptidase 
activities in the culture supernatant. As can be seen in Figure 9, 
35 there is a minor band above the major TNF protein band that may 
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represent the primary secreted TNF protein, or an intermediate 
degradation product. However, TNF lacking the first four amino acid 
residues is biologically active. 

Plasmids pSYC1412 and pSYC1414 have been deposited in 
5 applicants depository as CMCC #'s 3028 and 3029, respectively and were 
deposited in the ATCC on March 4, 1987. 

2. Secretion of alkaline phosphatase (AP) of E. coli in 
Streptomyces 

The production of the secretion for E. coli alkaline 
10 phosphatase in Streptomyces is shown schematically in Figure 10. The 
E. coli plasmid pSYC1204 contains the phoA gene with the cry gene 
terminator at the 3'-end of the phoA gene. A plasmid having the phoA 
gene and cry terminator may be constructed in accordance with the 
disclosure of European Patent Application No. 86302201.8 filed March 
15 25, 1986 An Nrul sequence may also be introduced into the phoA 
leader. Alternatively, the DNA sequence of the modified phoA leader 
and cry terminator may be synthesized as ol i godeoxyri bonucl eoti des . 
pSYC1204 has the following organization: 

. . . Hindi 1 1 . . . . DhoA promoter . . phoA signal peptide.. NruI ..AP BamHI.. 

20 cry terminator. .EcoR I ... ♦ 

An Nrul site was introduced at the coding sequence for the 
"-1" residue of the pre-AP protein. The plasmid pSYC 1204 was digested 
with Hindi 1 1 and Nrul , and the large fragment, containing the coding 
sequence for mature AP, purified. Plasmid pSYC1259 was digested with 

25 Pvui (which is located at the sequence corresponding to the second 
codon of the putative mature 0RF438 protein), the single-stranded 
di nucleotide was trimmed away with the exonuclease activity of DNA 
polymerase, and then digested further with Hindlll. The small 
fragment carrying the 0RF438 promoter and putative signal peptide 

30 coding sequence was ligated to the pSYC1204-deri ved large fragment 
under blunt end conditions. This resulted in the plasmid pSYC1267. 
The ORF-phoA fusion sequence was excised from pSYC1267 by BamH I and 
PstI digestion, and was cloned into the Pstl-Bgl I I sites in pIJ702 
which generated the Streptomyces plasmid pSYC1283. S^ 1 i vidans 
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carrying p5YC1283 was grown in R6 medium for 3 days and produced low 
level of AP in the culture supernatant as determined by a conventional 
spectrophotometric assay using p-Nitrophenyl phosphate (di sodium) as 
5 described in J. Bacteriol., 152: 692 (1982). The AP protein could not 
be identified from the total proteins in 1 ml culture by coomasie blue 
staining of the gel -banded proteins. 

A plasmid was also constructed that expressed phoA under the 
control of the aph promoter. Plasmid pSYC1283 was partially digested 

10 with SphI (one of the SphI sites on the plasmid is located in the 
0RF438 sequence corresponding to the translation initiation site), and 
then digested with PstI which is located upstream of the phoA 
sequence. The fragment that had the 0RF438 promoter and 
Shine/Dal garno sequence deleted was purified and li gated to the PstI - 

15 SphI fragment of pSYC1290 that carried the aph promtoer and the 0RF438 
Shine/Dal garno sequence. This generated the Streptomyces plasmid 
pSYC1328. 

The supernatant of a 3-day culture of S. li vidans carrying 
pSYC1328 grown in R5 medium was concentrated by diafiltration using a 

20 PM30 membrane. The exoproteins were analyzed by SDS-PAGE as shown in 
Figure 11. The secreted AP represents a major species of exoprotein 
synthesized. Surprisingly, the level is estimated to be around 5 
mg/1 . This concentrated exoprotein preparation was then fractionated 
in the presence of 0.1% SDS through a S-200 Sephacryl column. The 

25 secreted AP was then subjected to amino terminal sequence analysis. 
The 0RF438 -phoA fusion has the following junction sequence: 

position in 0RF438: -4 -3 -2 -1 +1 +2 
position in AP: +1 +2 +3 +4 

-Ala-Ala-Arg-Ala-Asp-Ala-Arg-Thr-Pro-G-lu- 

30 The sequencing data showed that about 1/3 of the purified AP 

has the Al a-Asp-Al a-Arg-Thr-Pro-. . . .sequence, about 2/3 started with 

the authentic mature AP sequence Arg-Thr-Pro-Gl u It is unclear 

whether the ORF-AP "pre" protein was processed at two different sites 
that yielded these two species of proteins, or that there was only a 

35 single signal peptidase processing site, but some of the protein was 
further degraded. In fact, heterogeneity at the amino terminus of 
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secreted proteins is not uncommon in many systems; one example in 
Streptomyces is the endo-B protein from S. pi i cat us . See Robbins et 
a1. 5 J. of Biol. Chem. 259 :7572-7581 (1984). The result is also 
surprising in that it suggests that the Aph promoter functions with 
5 great efficiency in the presence of a downstream Shi ne/Dal garno 
sequence. 

Example 2 

Synthesis and Molecular Cloning of the Human Lymphotoxin Gene and its 
Expression in Streptomyces 1 i vidans 

10 In this example, a synthetic Itx gene was assembled in 

E. coli and expressed in Streptomyces 1i vidans . Secreted lymphotoxin 
was produced as a major protein in the medium at the level of several 
microgram per milliliter in shake-flask culture. 

To assemble the synthetic Itx gene, ol i godeoxyri bonucl eoti de 
15 fragments shown in Figure 12 by the double lines were synthesized 
using an automated DNA synthesizer and were individually kinased. 
Synthetic fragments, each in 12 ul volume at 17 pmole/ul, were 
combined in pairs. To this, equal volume (24 (yl) of 2 x annealing 
buffer (200 mM NaCl, 40 mM MgCl 2 , 40 mM Tris-HCl, pH 7.9, and 40 mM 3- 
20 mercaptoethanol ) was added. This mixture was then heated to 70°C and 
gradually cooled to 10°C over several hours. 

In greater detail, the Itx gene sequence has been reported 
previously (P. W. Gray et al., Nature 312 :721-724 (1984)). A . smal 1 
number of long synthetic fragments were annealed and formed a 
25 partially double-stranded form. The gaps were then filled in by E. 
col i DNA polymerase jm_ vitro to generate completely double-stranded 
DNA for cloning. 

The synthetic Itx gene is shown in Figure 12. It has the 
following features: 

30 1. It encodes the mature Itx proteion of 171 amino acids. 

2. It has a total of 1074 (2 x 537) nucleotides; of which 
642 (60%) were made by chemical synthesis in ten fragments, the rest 
were by enzymatic synthesis. 



WO 88/07079 



PCT/US88/00374 



32 

3. Unique restriction sites, which facilitated subsequent 
construction for cloning and expression, were introduced at the 
termini and the internal regions. 

B. Assembly of the Synthetic Fragments 

5 The strategy was to anneal the fragments in_ vitro to form 

full-length (or greater size) molecules with gaps, and then to fill in 
the gapped regions with DNA polymerase, and to seal the joints with 
DNA ligase. The fragments were first annealed in pairs so that each 
duplex contained 5 1 -protruding ends to allow DNA polymerase-directed 
10 synthesis. The five pairs consisted of (a) fragments 1+2, (b) 3+4, 
(c) 5+6, (d) 7+8, and (e) 9+10. 

Three pairs of the annealed fragments (c), (d), and (e) were 
then combined (total of 144 pi), and 30 pi of 2.5 mM dXTP, 3 yl of 10 
mM ATP, 2 ul of DNA polymerase Kl enow enzyme, 2 yl of T4 DNA ligase 

15 were added. The reaction was carried out at 4°C for 20 minutes 
followed by incubation at 20°C for 60 minutes. The reaction was then 
stopped and the DNA extracted. A fully double-stranded DNA was 
synthesized. The DNA fragments were digested with Hindi I I and BamH I , 
purified from gel, and cloned into the corresponding sites of pUC13 

20 plasmid vector. From plasmid screening of 32 candidates, 24 correct 
clones were identified. Plasmicl pSYC1378 was a representative in this 
group. 

A similar approach to assemble pairs (a), (b), and (c) into 
a single long fragment was unsuccessful. However, the double-stranded 

25 DNA that was produced was digested with Hindi 1 1 and the resulting 
fragments were cloned into a Hindlll/Smal doubly digested pUC13 vector 
DNA. Although none of the candidates screened had the structure 
completely correct, clones with partially correct structure by 
restriction endonuclease mapping were identified. Among the double 

30 stranded fragments were found those that carry the correct sequence in 
the regions covering the fEcoR I -Nsi I , the Nsil-Kpnl , and the KpnI - 
Hindl ll sequences. These three fragments were cloned along with the 
, Hindi I I -BamH I fragment from pSYC1378 into the EcoR I -Hindl l I sites of 
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pUC18. The derivative that contained the Ttx sequence was designated 
pSYC1408. 

• C. Expression in Streptomyces lividans of the Itx gene 

Two plasmids were constructed to express the Itx gene. The 
5 amy- It x fusion plasmid consists of four parts generated by ligation of 
the following fragments: 

1, the Ux fragment - The Jtx gene in pSYC1408 was excised 
with HinP I (an isoschisomer of Hhal ) and BamH I (the HinPI-generated 
termini can be li gated to Narl -generated fragments), 
10 2. the B. thuringiensis retroregul ator (the cry terminator) 

flanked by BamHI and EcoRI from plasmid pLWlt-10. The cry terminator 
may also be obtained in the same BamH I-EcoRI fragment from pHCW701, 
ATCC #39757. 

3. the ermE -amy fusion fragment ( ermE promoter and amylase 
15 signal peptide coding sequence), described above, flanked by Sstl-Narl 

from pSYC1311. 

4. the large pUC13-deri ved fragment generated by SstI and 

EcoR I . 

The resulting plasmid was designated pSYC1443. The ermE- 
20 amy-ltx-cry fusion was excised from pSYC1443 with EcoRI. The termini 
were repaired with Klenow fragment and the fragment was then digested 
with SstI . It was li gated into the Streptomyces plasmid vector pIJ702 
under blunt end conditions between the PstI site that had been trimmed 
with SI nuclease to form flush end and the SstI site. The resulting 
25 plasmid, pSYC1481, was introduced into Streptomyces lividans strain 
TK24 by protoplast transformation for expression studies. 

Similarly, the Itx gene was fused to the signal sequence of 
0RF438. The modified 0RF438 sequence with a Narl site introduced at 
the end of the signal peptide coding sequence as described above was 
30 obtained from pSYC1292 by Hind i and Xbal digestion. The ermE 
promoter fragment with the added SphI site was excised from -pSYC1303 
with SphI ; the termini were then trimmed with the exonuclease activity 
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of DNA polymerase and digested with EcoR I* These two fragments were 
then li gated into the EcoR I -Xbal region of pUC18. The resulting 
plasmid, pSYC1449 {and also pSYC1448), was employed for expression 
vector construction* 

5 The 0.4-kb Sstl-Narl fragment containing the ermE -ORE fusion 

(ermE promoter and the signal peptide coding sequence of 0RF438) was 
purified after partial digestion with Narl and cleavage with SstI of 
the pSYC1449 DNA. It was ligated with VU from pSYC1408, the cry gene 
retroregulator from pLWlt-10, and the pUC13 plasmid fragments, as 

10 outlined above for the construction of pSYC1443. The resulting 
plasmid, pSYC1450, contained the promoter of ermE followed by 
sequences derived from ORF, Itx and cry retroregulator. This region 
was then excised with EcoR I ; the termini were repaired with Klenow 
fragment and then digested with SstI , and cloned into the PstI site, 

15 which was trimmed with SI nuclease, and SstI site of the Streptomyces 
plasmid pIJ702 to generate plasmids pSYC1483 and pSYC1481. Plasmid 
pSYC1483 has been deposited in applicants 1 "culture collection as CMCC 
#3031 and was deposited in the ATCC on March 4, 1987. 

Streptomyces li vidans strain TK24 transformed with either 
20 pSYC'1481 or pSYC1483 and grown in shake flasks for three days in R6 
medium and the extracellular lymphotoxin activity was determined using 
the assay mentioned above for TNF activity. TK24 (pSYC1481) produced 
about 500 U/ml, wherease TK24 (pSYC1483 } produced 4 x 10 3 U/ml . The 
level of TK24 (pSYC1483) reached 4 x 10 4 U/ml on day 5 and 1.2 x 10 s 
25 U/ml on day 7. 

The extracellular proteins from TK24 (pSYC1483) were 
analyzed by SDS-PAGE as shown in Figure 13. The apparent 16-kd 
protein is slightly smaller than the expected size for mature Itx or 
171 residue* The intensity of this putative Itx protein band is 
30 consistant with the activity assay data, and it represents about 2-4 
yg/ml of lymphotoxin in the medium on day 7. 
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Example 3 

Modification of promoter-signal sequence construction for increased 
secreted expression 

The chimeric amy signal sequence-TNF gene under the control 
of the aph or the ermE promoters expressed TNF at relatively lower 
5 levels in Streptomyces lividans in the example above- A similar 
construction using the 0RF438 signal peptide fragment that was used in 
the expression of lymphotoxin was made. 

The construction of ORF signal peptide-TNF fusion gene is 
shown in Figure 14. In plasmid pSYC1449, the 0.28-kb ermE promoter 

10 fragment is located upstream of the 0.12-kb fragment that contains the 
0RF438 ribosome binding site and signal peptide coding sequences. By 
a four-fragment ligation procedure, the ermE prompter, the 0RF438 
ribosome binding site, the 0RF438 signal peptide coding sequence, the 
TNF coding sequence, and the B.t. retroregul ator were linked to a 

15 pUC13 vector. pSYC.1449 was partially digested with Narl , and digested 
to completion with Sstl . Plasmids pUC13, M13pAW711 and pAW711D were 
digested as described above in connection with the production of 
plasmid pSYC1414. The required fragments could also be obtained from 
pSYE1414. The large EcoRI-SstI pUC13 fragment, 385-bp Hpall-BstEII 

20 fragment of M13pAW711, 490-bp Bstll-EcoRI fragment of pAw711D were 
combined with and ligated to the Narl-SstI fragment of pSYC1449 under 
sticky end conditions. A plasmid having the fragments in proper order 
as determined by restriction endonculease digestion and mapping was 
designated pSYC1477. 

25 The ermE - ORF -TNF fusion gene from pSYC1447 was excised by 

EcoRI, followed by repair with Klenow fragment to generate flush ends 
and digested with Sstl . The Sstl -blunt fragment was inserted into the 
Streptomyces vector pIJ702 between the PstI (blunt-ended) and the 
EcoR I sites. The plasmid generated by this procedure was designated 

30 pSYC1493. pSYC1493 has been deposited in the applicants 1 culture 
collection as CMCC #3032 and was deposited in the ATCC on March 4, 
1987. 
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A modified ermE promoter, ermEla, was also used in the 
analogous construction. This promoter differs from its parent by the 
deletion of the TGG triplet near the 11 -35" region of the ermE promoter 
(see Figure 1). 

5 Figure 15 outlines the construction of the plasmid pSYCI504, 

which is similar in pSYC1493 except that the ermE promoter in pSYC.1493 
was replaced by the ermEl a promoter carried on plasmid pIJ4065. The 
modification of the ermE promoter to produce ermEl a promoter can also 
be carried out using site-specific mutagenesis of the small BanH I to 
10 BamH I fragment of pSYC1472 to yield the same result, 

S* Li vi dans TK24 transformants carrying pSYC1493 or pSYC1504 
were grown in shake flasks in R6 (trypton soy broth) medium for 3 
days. TNF proteins in the culture supernatants and cell pellets were 
then analyzed. 

15 Table 2 shows the TNF activity assay results of various TK24 

transformants. The TK24 (pSYC1493) strain produced about 7-fold more 
TNF than TK24 (pSYC1414). These two strains differ in two aspects - 
the ribosome binding site and the signal peptide coding sequences. 
The ORF signal peptide, together with its ribosome binding site, is a 

20 better combination then the cognate sequences from the amy (Sli) gene. 

Strain TK24 (pSYC1504) showed a further 2-fold increase in 
TNF expression than TK24 (pSYC1493). 

TABLE II 



Secreted TNF form TK transformants 



25 



Plasmid 



pSYC1412 
pSYC1414 
pSYC1493 
30 pSYC1504 



S/D + 
Promoter Siq. Pep. 



aph 
ermE 
ermE 
ermEla 



amy 
amy 
0RF438 
ORF 438 



Secreted TNF Junction Sequence 
U/ml; (pg/ml)* (Signal peptide— TNF) 

1.3 x 10 4 



4.0 x 10 4 
2.7 x 10 5 (-10) 
5.5 x 10 5 (-20) 



...PAAA-AVRSSS... 
...PAAA-AVRSSS... 
..uPAARA-VRSSS... 
...PAARA-VRSS... 



♦Specific activity = 2-3 x 10 7 U/mg 
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The total cellular proteins and the secreted proteins from 
these strains were analyzed. As shown in Figure 16, which shows 12.5% 
SDS-page preparation of culture supernatant s , TNF is the most abundant 
protein species in the three-day culture medium supernatants from the 
5 two TNF-producing strains. From the stained gel, it is estimated that 
TK24 (pSYC1504) produced about 0.5 yg of TNF in 25 yl of culture- 
equivalent to 20 mg/1 . This estimate matches the activity assay 
result shown in Table II. 

Western analysis of cellular proteins (Figure 17) showed 
10 that there were three TNF-related protein species in the cell, one in 
the culture supernatant. The lowest band corresponds to the "-4" form 
of TNF. The highest band likely represents the full-length ,l pre~TNF N 
protein "with the 30-residue signal peptide sequence still linked to 
TNF. The middle band located just above the "-4" TNF protein is 
15 believed to represent the processed TNF generated from pre-TNF by 
signal peptidase cleavage. 

To the extent that full-length mature TNF is represented by 
the middle band, then the ami no-termi nal four residues were removed 
from this protein before it was released into the medium. The 
20 detection of these two forms of matured TNF proteins in the cell 
indicates that processed TNF molecules are initially trapped in the 
cell, most likely outside the cell membrane but inside the cell wall. 

We propose the following post-trans! ational processing 
events that lead to the secretion of TNF: 

25 Pre-TNF having the recombinant signal sequence is produced 

inside the cytoplasmic membrane. A signal peptidase active on the 
membrane processes the pre-TNF and secretes a full-length mature TNF 
associated with the cell envelope, but outside of the cytoplasmic 
membrane. An aminopeptidase associated with the cell envelope further 

30 processes the full length mature TNF to produce the "-4" form of TNF 
which is released into the medium as secreted "-4" TNF. 

As will be apparent to those skilled in the art, the 
secretion vectors according to the invention are useful in the 
secreted production in Streptomycetes of heterologous proteins from 
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various sources, including mammalian protein. The secretion vectors 
with heterologous DNA other than that specifically exemplified are, of 
course, considered to be within the scope of the invention. 
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WHAT IS CLAIMED IS: 

1. A recombinant vector suitable for secreting heterologous 
proteins in streptomycetes comprising a plasmid repli cable in 
streptomycetes, said plasmid comprising a promoter functional in 

5 Streptomyces, and a DNA sequence encoding a secretion signal sequence 
that is operable in streptomycetes and may be operably linked at a 
site for insertion of a heterologous gene under the control of said 
promoter. 

2. The recombinant vector of claim 1 further including a 
10 heterologous gene in operable linkage with the secretion signal 

sequence. 

3. The recombinant 'vector of claim 1 wherein the promoter 
is selected from the group consisting of the aph , ermE promoter, 
ermEl A and modified ermEl a. 

15 4. The recombinant vector of claim 1 wherein the signal 

sequence is a streptomycetes signal sequence.' 

5. The recombinant vector of claim 4 wherein the signal 
sequence is selected from the group consisting of the amy signal 
sequence and the 0RF438 signal sequence. 

20 6. The recombinant vector of claim 1 comprising the aph 

promoter and the secretion signal sequence selected from the group 
consisting of the amy secretion signal sequence, and the 0RF438 
secretion signal sequence. 

7. The recombinant vector of claim 6 wherein the 0RF438 
25 secretion signal sequence and the. aph promoter are separated by an 
intervening DNA sequence comprising a Shi ne/Dal gnrno sequence. 
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8. The vector of claim 1 wherein the promoter is ermE-Sphl 
and the secretion signal sequence is selected from the group 
consisting of the amy secretion signal sequence 0RF438. 

9. The vector of claim 1 wherein- the heterologous gene 
5 encodes a protein of a eukaryotic or prokaryotic protein. 

10. A host streptomycetes cell transformed with the vector 
of claims 1, 2, 3, 4, 5, 6, 7, 8 or 9. 

11. The host cell of claim 15 wherein said streptomycetes 
are selected from the group consisting of S. lividans, S. coeli color , 

10 S. qriseus, S. parvulus , S. a! bus , S. vinaceus, S. acrimycini , S. 
cal vuligerus , S« rubi ginosis, S. azureus, S, glaucescens, S. rimosus, 
S. limosus r S. kanamyceticus, S. violaceotuber, and any streptomycetes 
related host in which the rep! icon of PIJ702 is functional. 



WO 88/07079 



PCT/US88/00374 



1 / 19 



a. o 

00 < 

<=C CJ 

Z=> O 

CU 6-« 

-3 CJ 



3 

cj 



cC o 

-J a 

<c o 

<c cj 

o 

0= £-H 

>H «=C 

H O 

ZD CJ 

CO H 

-J o 

ZD CJ 

_) cC 

CJ <=C 



CU H 

J o 

o cj 

cc a 

<c o 

CC H 

CU o 

00 o 

CU O 

CO <=£ 

«=c o 



CU 



cc 

— ) E-< 
M CJ 

Cm O 

co <c 
<c o 

2= CJ 
~J <C 
O CJ 

<C CJ 

—J cj 
«=c cj 

co e-h 
. X cj 
o cj 

o cj 
cc o 
<=c cj 

CJ CJ 

cc o 

-3 CJ 
<C E-h 

. > cj 
cd cj 

CO E-h 

J o 

>H O 

~J cj 

cj cj 

CC H 

CO O 
CO o 

. >H CJ 

~j a 
o cj 

CU E— • 

cc o 

E-> O 



to 

I 

I — I 

O 

U 



cj 



en 
en 
i 



CX 

o| 



o 
*7 



o o 

H < 

<=C E- 

o o 
o o 

H < 

a o 

cj o 

o cj 

cj cj 

u o 

<c h 

o cj 

o o 

cj cj 
a cj 

cj a 
cj o 

CJ o 

H < 

•=C H 

CJ CJ 

CJ CJ 

CJ O 

CJ CJ 

a cj 

<c e-H 
cj 

H 
E-h 
O 
CJ 

u 

• CJ 

o 

H 

o 
<c 

E- 
CJ 

o 



CJ o 
CJ o 



cc 

CJ 
CJ 
CJ 
CJ 
CJ 
CJ 

a 

E-h 

CJ 
CJ 
CJ 
CJ 
CJ 
CJ 

o 
«=c 

CJ 
CJ 

• e-h 

H 
CJ 

o 



CJ 1 

O - ' 
CJ 
CJ 

o 
«=c 

CJ 

e-H 

CJ 
CJ 
«=c 
<* 

o 

_ CJ 

CJ o 

E-h < 

CJ CJ 

E-h <=C 

CJ CJ 

<C g-i 

O CJ 

CJ CJ 

CJ CJ 

E- «| 
CO E— « { 

2: al 

CL. CJ 
CO <c 
<C CJ 

J CJ 

■ cc 6-h 

:=» o 



o 

cc 
Q-i 

CO 

cc 

CU 

ZD 
CU 



CJ 
CJ 
CJ 

E- 
E-i 
CJ 



00 



CL 
CU 



en 
co 
\ 



E— 
CJ 

o 

cc E- 

CJ o 
O CJ 
E-i cc 
H cc 
CJ O 
CJ CJ 
CJ CJ 
CJ CJ 
CJ CJ 

t-4 CC 

a a 
o cj 

<C £-« 

cc 

0 o 

cS H 
CJ CJ cc 

CJ 

o 
o 
o 



H 

o 

CJ 

o 

o _ 

E— • <=C 

CJ o 

e-» cc 

H <=C 

CJ CJ 

E- cc 

CJ CJ 

CJ CJ 

e- cc 

cc 6-» 

CJ CJ 

CJ CJ 

CJ o 

CJ o 

CJ CJ 

<C g-i 

O CJ 

CJ o 

o o 

t~< CC 

o o 

CJ CJ 

cc 8-1 

CJ CJ 

cC H 

O CJ 

CJ CJ 

6-t <C 

CJ o 

CJ CJ 

CJ CJ 

CJ o 

CJ CJ 

<C 6- 

CJ o 

CJ o 

O CJ 



o 

T 



CM 
CL 
CU 

E 



in 
m 

I 



CJ 
E-« 
CJ 

cc 
CJ 
CJ 
cc 

o 



E-h 

o 

cC 

CJ 
CJ 

CJ 



o o 

CJ CJ 

o o 

CJ CJ 
CJ CJ 

e-H cc 

CJ o 
CJ o 

cC H 

O CJ 

cc E- 

CJ CJ 

O CJ 

o o 

O CJ 
E— ' cc 
CJ CJ A 
CJ^ 



CJ 

o 

CJ 
cc 
CJ 

e- 

H 



o 

CJ 
E-h 

CJ 

2-$ 



o 

CO 



o 
cc 

Cu 



cc 
<c 

2: 

CO 
cC 



2: 

CO 
cC 

o 
cc 

CU 



cc 

cC 

CU 
CO 
cC 



CO 
<C 



cc 

cC 
CJ 

cc 

cC 

o 
cc 

CU 



cc 

cC 

o 

cc 
cu 



co 



DC 

CU 

CO 

cc 

CU 
CO 



CO 



0 


cC O 


CJ 


cC O 


CJ 




CJ 


CU CJ 


CJ 


-J r- 
HH cc 


CJ 




cC 


. cc 0 


cC 


n: cj 

c~* cc 


a 




cC 


00 CJ 


O 


>H CC 


—3 <=C 


O CO 






O O 


2- 


CC CJ 
cC CJ 


O 




O 


CU CJ 


O 


• 00 cC 
cC O 


CJ 


CJ O 


cC 


CJ 


CC O 
cc O 


CJ 




cC 


ZD O 


CJ 


CU c-< 


*J O 


CJ 




CJ 


CU CJ 


0 




- CU £— * 


E— 1 


s: cj 


cC 


O 


00 cC 


00 


<si cC 




z: 0 


CJ 00 


<=c 


cC 


CJ CJ) 
>H cj 


cC 


CJ CJ 






0 


. CU CJ 
ZZ. E— < 


CJ 00 




cu 


CJ"* 




0 


SS CJ 




-J cC 


CJ 0 


p—t 




O 


CJ 0 


CJ 


CC CJ 


cC O 


0 




0 


CJ 0 


CJ 


• CC CJ 


cC CJ 


CJ 




CJ 


g 0 


CJ 


00 cC 
cC cC 


CJ 




CJ 


CJ 0 


CJ 


CC O 


cc O 


O 00 






CU 0 


s- 


00 cC 
• cc CJ 


CJ 




cC 


CJ 0 


0 


CC O 


cC CJ 


CJ 




cC 


O CJ 


0 


-J cC 


CJ 0 


CJ 




CJ 


CC CJ 


E-* 


CC CJ 
H cc 


E— • 




O 


* CJ E-h 


cC 


CC CJ 
cC O 


CJ 




CJ 


>H O 


cC 


—J 0 
CJ 0 


°l 




CD O 


Ol 


CU E-h 
—1 CJ 


O O 




CJ CJ 


-J O 


O CJ 


. cc H 


cc E-h 


> CJ 



o 

co 



C5> 
CO 

m 



THR ARC 
ACC AGG 


• 

LYS LEU 
AAG CTT 
Hindlll 


-O CJ 
CU c-i 
-U CJ 


ZD CJ 
-3 cc 
O CJ 


CD CJ 
CU 

—3 O 


CJ CJ 
CC CJ 
cC O 


>h O 
J O 
CJ O 


. CD O 
CU H 
—3 E-h 


CD cc 
J cC 
O CJ 


CC O 
CU CJ 
00 E-h 


—3 O 
cc H 
O O 


00 CJ 
>H <c 
—3 cC 


O CJ 
CC CJ 
Cu CJ 


cC CJ 
. _3 CJ 
cc O 


>H CJ 

J CJ 
CJ CJ 


ZD CJ 
CU E-h 
_J cj 


cC O 
—3 O 
cC CJ 


O CJ 
CC O 
cC O 


CD cC 
J cC 
O O 


O CJ 

cc 0 

• Cu CJ 


CD O 


CU CJ 
00 CC 
cC CJ 


—3 O 
cc EH 
> CJ 


CU CJ 
-3 E-h 
i— 1 <c 


O O 

cc 0 

CU CJ 


CD a 

-J cC 


CD O 
CU E-< 
•— J UJ 


- CC CJ 

>H CC 

j— 1 c— » 


CU E-h 
00 cc 
cC CJ 


CC O 
CU CJ 
00 E-h 


O CJ 
CC CJ 

cu cj 


cc O 

cc 0 

E-h <C 


CJ 0 
CC CJ 
<c CJ 


—3 CJ 
• cc E— 
> O 


CD CJ 
CU E-h 
, n CJ 


S O 
CJ 0 


CD O 
_3 <C 
CJ CJ 


O E-h 
CC CJ 
cC CJ 


cC O 
-3 O 
cC CJ 


<C O 
-3 CJ 
• cc CJ 


CC cc 
CS CJ 
E-h CC 


O O 
CC CJ 
cC CJ 


ZD CJ 
^3 cc 
CJ O 


Cu O 
CO < 
< O 


cC CJ 
*-3 CJ 
cc CJ 


cC CJ 
-J CJ 
cc CJ 


CU CJ 
-3 E-h 
J-H < 


. ZD O 
CU E-h 
-J CJ 


O O 
CC CJ 
cc O 


CD cc 
-) cC 
CJ CJ 


ro 


LPv 



M 

a> 
co 

(9 
O 

! 

Cm 

a) 

M 

-P 
CO 

o 

0) 



o 

CJ 
O 
•H 
Cn 
<S 

a 

hU 

o 

s 

M 

04 



1 



SUBSTITUTE SHEET 



WO 88/07079 



PCT/US88/00374 



2 / 19 



o 



I 



o 
<c 
o 
e-H 
o 

H 

o 
o 
o 
o 
cj 
<c 
o 
o 
o 

C\J o 
CL O 

cJ o 
ml <=c 
<=c 
CD 

o 
o 
o 
o 
o 
o 

H 

in H 



i 



o 
o 
o 
o 
o 

a 
o 
o 
o 
o 
o 
o 

o 
o 
<c 
o 
o 
cj 
o 
o 
o 
o 

CJ 

o 
o 
o 
cj 
o 
o 
o 
o 
cj 
o 

£—* 

o 
o 
o 
o 
o 
cd 
o 

H 
o 



cd cd 

o o 

o o 

o « o 

o nr o 
otu 

o [2 a 

H m CD 

CJ 6-h 



O 
H 
O 
O 
<s 
o 
o 
o 
o 
o 
o 
o 
cd 
o 
o 
o 

o 
o 
o 
o 
o 
<=c 
o 
a 
o 
o 
o 
o 
o 
o 
o 
cd 
o 

o 
o 
cd 
<c 
o 
cd 
o 
o 
o 
o 
o 
cd 
o 
o 
a 
o 
o 
o 
o 
o 
o 
cd 

o 
o 

H 

o 
o 
o 

H 

cj 
o 
o 
o 



o 
o 
o 

CD 

cj 

o 

o 
o 
<c 
Em 

o 
cj 
cd 
o 
<c 
o 
o 
o 
<5 
o 
H 
o 
o 
cd 
o 
o 

6m 

o 
o 
E-i 
cd 
o 
£m 

cd 
o 
o 
o 
o 
o 
<c 
o 
<s 
o 
o 
o 
o 
o 
o 
o 
Em 
o 
o 
o 
o 
o 
cd 

o 
o 
o 
cd 
«a: 
o 
o 
cd 
o 
o 



uoO 

o 



o 
<c 
o 
o 
a 
o 
o 



cj 
o 
o 
o 
o 
o 
e-< 



cj o <c 

CJ r- O 



o 
cd 
a 
o 
o 
o 
o 
o 



o 
o 
o 

H 

o 
o 



CD 

o a. a 
o qJo 
o o 

o 
o 
o 
o 
o 
cd 



o 
a 
o 
o 
<c 

o m o 
o cn o 
O J o 



CO t- 



<=: 
cd 
o 
o 
cd 
o 
cj 
o 
o 
o 
o 
o 
cd 
o 

CJ 

o 
<s: 
o 

CD 

o 
cd 
o 

o 

CJ 

<c 

CD 
CD 
CD 
CJ 

o 
o 
o 

CJ 

cj 
o 
« 
o 
o 

o 

CD 



CM 



a 
o 
o 
o 
<=c 
<x 
<c 

CD 
CJ 

<c 

CD 

o 
o 
cd 

CJ 

o 
o 

CD 
CD 

<x: 
H 

o 
o 
o 
<x 
«=: 
<=: 

CD 
CJ 

o 

H 
o 

-a; 
o 

CJ 

o 
o 



CM 

n 



<^ CO 

o <c 

CD ~J 

O CD 

« ZD 

CD CD 

CJ 

<C CO 

< <s 

O ^3 

CD > 

<c <c 

O ~J 

CD <C 

O CO 

<C I — I 

o 

CD Dm 

o cc 

H H 

CD O 

< ~J 

O CD 

CJ CO 

CJ oc 

CJ CO 



CD 
CJ 
CJ 

CD 



o 



O CO 

« »-J 

o o 

CD CC 

o <c 

CJ CD 

CD CC 



CD 



^3 



CJ 3C 

CJ PC 

CD U3 

< CO 



CD 
CD 



Oh 

CO 



CJ cu 
«=C CO 
CDI <a; 

CM 

o 



O CO 

<c -J 

CD <C 
CJ *J 

o <c 

CJ OS 
<C >-* 
Em H 

CD ZD 

O -3 

CD ZD 
<C ^J 
CD CD 

CD O 
CJ CC 
CD CU 

CD s: 
<c ^J 
CD O 

o o 

CD CZ 
CD Oh 

CD ZD 

<: wD 
o o 

O'O 
o cc 

O CU 

O >H 

CD ^J 
CD CD 

O >H 

CD —3 
CD CD 

CJ cc 
cd rc 
<=c s— « 

O ZD 
H CU 
O «—3 

CD 52: 
<=C ^D 
O CD 

CD CC 

S-h Em 

O »~J 

o > 

O Cx3 

e-i re 

Em Cu 

o <c 

CJ -J 

o <c 

CJ >H 

CD ^J 
CD CD 

CD CC 

ir+ CO 

in 



nj 

•H 
Tl 

u 

o 
>1 

B 
o 

a. 
n 

CO 

o 
a 

Di 
A 

a) 
XI 
-p 

o 

o 

0) 
M 

k 
o 
-p 
o 
s 
o 
u 
to 



WO 88/07079 



PCT/US88/00374 



3/19 



C 

XL 



W 

X , 
ay 

CO 

GQ 



— co 
CD 



a 

a.-i 
< 



CO 



o 
o 



CM q: 
N- o 
OJ O 
— ' UJ 



CO 

in 



cc 
o 
o 

UJ 



o 

o 



o O 
E <fr 
-co w 



co 
CL 



1=3 
O 

c 



X 

CO 

CD 



OLCSJ 

< *r 



w 

o 



o 

CL 
< 



JZ 
CL 
CO 



*5 

5 



0) 

CO 



<ti y ^ 

SUBSTITUTE SHEET 



WO 88/07079 



PCT/US88/00374 



U / 19 




F/G.-4. 



1 CCAGACCGTGCAGCGCACCGAGTTCGTCTTCCAGCCGGAGCTGGTGGTGCGCGGCTCCAC 
GInThrValGlnArgThrGluPheValPheGlnProGluLeuVal Val ArgGlySerThr 

>>».„>>> >>..<< <<<;..<<<< 

61 CGCCCAGTGGGTGCCCGGCGGCTGAGGGGATGTACGGGGCGCTTCGTCCGGAGCCGCCCG 
Al aGl nTrpVal ProGlyGly . . . 

121 CAATC TCTTGCAGAGGCTTGCGGGCCCTGCTGCCCGGCGCCCAACCCCCTTGATGTCCGT 



181 GGTGTTTCGGGTTCGTGTCCAAAGGGTTGAGCGCGGGTACCGCTCGCTCTACGGTCTGCT 
241 TCGCGACGC TCCTTGCAGTTTTGCTGCAAGAGAATTCAGCCCTCCGCCCCCCCCGATCAG 



301 GAGGCACCACATGGCCCGCAGACTCGCCACCGC GTCCCTAGCCGTGCTGGCGGCGGCCGC 
META1 aArgArgLeuAl aThrAl aSerLeuAl a Val LeuAl aAl aAl aAl a 
-28 -25 -20 -15 

351 CACCGCCCTCACCGCGCCCACACCCGCCGCTGCCGCCCCGCCCGGGGCGAAGGACGTCAC 

ThrAl aLeuThrAUProThrProAlaAl aAl aAl aProProGlyAl aLysAspValThr 9 
-10 -5 -l^l 

421 CGCCGTCCTCTTCGAGTGGAAGTTCGCCTCCGTAGCCCGCGCCTGCACCGACAGCCTCGG 

AlaValLeuPheGluTrpLysPheAlaSerValAlaArgAlaCysThrAspSerLeuGly 29 

481 CCCGGCCGGCTACGGATACGTCCAGGTCTCGCCGCCCCAGGAGCACATCCAGGGCAGCCA 

ProAl aGlyTyrGlyTyrValGlnValSerProProGlnGluHi s 1.1 eGl nGlySerGln 49 

541 GTGGTGGACCTCCTACCAGCCCGTCAGCTACAAGATCGCCGGACGGCTCGGCGACCGCGC 

TrpTrpThrSerTyrGl nPr5 Val SerTyrLysIl eAl aGlyArgLeuGlyAspArgAl a 69 

601 CGCCTTCAAGTCCATGGTCGACACCTGCCACGCGGCCGGCGTCAAGGTCGTC GCCGAC TC 

AlaPheLysSerMETValAspThrCysHisAl aAl aGl y Val Lys Val ValAl aAspSer 89 

F/G.-4-/. 
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661 GGTCATCAACCACATGGCCGCGGGTTCCGGCACCGGCACCGGCGGCAGCGCGT 
Val II eAsnHi sMETAl aAl aGlySerGlyThrGlyThrGlyGlySerAl aT 

721 GTACGACTACCCGGGCATCTGGTCCGGCGCCGACATGGACGACTGCCGCAGCGAGATCAA 

TyrAspTyrProGlyll eTrpSerGlyAl aAspMETAspAspCysArgSerGl ull eAsnl29 

781 C GAC TAC G GC AAC C GC GC C AAC GTCC AGAAC T GC GAAC TGGTC GGC C TC GC C GAC C TC GA 

As pTy rGl yAsnAr gAl aAs n Va 1 Gl nAs nCy sGl uLeu Va 1 Gl yLeuAl aAs pLeuAspl 49 

841 C AC C G GT GAGTC G TAC GTC C GC GAC C GC ATCGC C GC C T ACC TC A AC GACC TGC TC TC GC T 

ThrGlyGluSerTyrValArgAspArglleAl aAl aTyrLeuAsnAspLeuLeu$erLeul69 

901 CGGTGTGGACGGCTTCCGCATCGACGCCGCCAAGCACATGCCCGCCGCCGACCTCACCGC 

GlyVal AspGlyPheArgll eAspAl aAl aLysHi sMETProAl aAl aAspLeuThrAl al89 

961 C ATC AAG GC C A AG GTC G GC AAC G GGAGC AC GTAC TGGA AGC AGGAG GCCATCCACGGCGC 

II eLysAl aLysValGlyAsnGlySerThrTyrTrpLysGl nGluAl all em sGlyAl a209 

1021 G6GCGAGGCCGTCCAGCCC AGC GAGTACCTC GGC ACCGGC GAC GTCC A6GAGTTCCGCTA 

Gl yGI uAl a Va 1 Gl nProSerGl uTy rLeuGl yTh rGl yAsp Va 1 Gl nGl uPheArgTyr229 

1081 C GC C C GC GAC C TC AAGC G GGTC TTC C AGAAC GAGAAC C TC GC C C AC C T GA AGAAC TTC GG ' 
Al aArgAspLeuLysArgVal PheGl nAsnGl uAsnLeuAl aHi sLeuLysAsnPheGly249 

1141 CGAGGACTGGGGCTACATGGCGAGCGGCAAGTCCGCCGTCTTCGTCGACAACCACGACAC 

GluAspTrpGlyTyrMETAl aSerGlyLysSerAlaValPheValAspAsnHi sAspThr269 

1201 CGAGCGGGGCGGCGACACCCTCAACTACAAGAACGGCTCCGCCTACACCCTCGCCGGCGT 

Gl uAr gGl yGI yAs pTh rLeuAs nTy rLy sAs nGl y Se rAl aTy rTh rLeuAl aGl y Va 1 289 

1261 CTTCATGCTGGCCTGGCCCTACGGCTCCCCGGACGTCCACTCCGGCTACGAGTTCACCGA 

PheMETLeuAlaTrpProTyrGlySerProAspVal Hi sSerGlyTyrGl uPheThrAsp309 

1321 CCACGACGCCGGCCCGCCCAACGGCGGCACCGTCAACGCCTGCTACAGCGACGGCTGGAA 

Hi sAspAlaGlyProProAsnGlyGlyThrValAsnAlaCysTyrSerAspGlyTrpLys329 

1381 GTGCCAGC ACGCCTGGCCCGAGCTCTCCTCCATGGTCGGCCTGCGCAACACCGCCTCCGG 

CysGl nHi sAl aTrpProGl uLeuSerSerMETValGlyLeuArgAsnThrAl aSerGly349 

1441 GCAGCCCGTC ACC AACTGfiTGGGACAACGGCGGC GACC AGATCGCCTTCGGCCGC GGC GA 

Gl nPro Va 1 Th rAs nTr pTrpAs pAs nGl yGI yAs pGl n 1 1 eAl aPh eGl yArgGl yAs p369 

1501 CAAGGCGTACGTCGCCATCAACCACGAGGGCTCCGCGCTGAACCGCACCTTCCAGAGCGG 

LysAl aTyr Va 1 Al a 1 1 eAs nHi sGl uGl y SerAl aLeuAs nAr gTh rPh eGl nSe rGl yi«y 

1561 CCTGCCCGGCGGCGCCTACTGCGACGTCCAGAGCGGCAGGTCCGTCACGGTCGGCTCCGA 

LeuProGlyGlyAlaTyrCysAspValGlnSerGlyArgSerValThrValGlySerAsp409 

1621 CGGCACCTTCACCGCCACCGTCGCCGCCGGCACC^CCTGaCCTGCACACCGGGGCCCG 

GlyThrPheThrAl aThrValAl aAl aGlyThrAl aLeuAl aLeuHi sThrGlyAl aArg429 

1681 TS G "/S 

FIG.-4-2. 
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1741 CAACGCCACCACCGCCTGGGQCGAGAACATCTACGTCACCGGTGACCAGGCCGCCCTCGG 
1741 CJACGCCACCACL^ ^^^^ eT yrVa 1 Th rGlyAs pGl nAl aAl aLeuGly469 

i am CAACTGGGACCCGGCCCGCGCCCTCAAGCTCGACCCGGCCGCCTACCCGGTGTGGAAGCT 

18 Afn^pAspProAl aArgAl aLeuLysLeuAspProAl aAl aTyrPro ValTr P LysLeu489 

1 861 CGACGTGCCGCTGGCCGCCGGAACCCCCTTCCAGTACAAGTACCTGCGCAAGGACGCCGC 



1921 GGGGAAGGCCGTCTGGGAGTCCGGCGCCAACCGCACGKGAC 

GlyLysAl a Val TrpGl uSerGlyAl aAsnArgThrAl aThr Val GlyThrThrG 1 yAl ab^y 



:accaccggcgc 

lyLy 

>>>>>>>>>>.>>>>• • 

1981 cctcaccctcaacgacacctggcgcggctgacccc&cccccatcggcccggcgcccgcct 

LeuThrLeuAsnAspThrTrpArgGly . « . 

538 

2041 CCCCGCGG^CGCCGGKCGCTGCCGC 

2101 CCCCGAAGGAGACCCGTCCCCGTGCGACGGACCACCATCCTCGCGGTGAGTCTCGGCCTG 
2161 TGCGCCGCCCTCACCGCCACCCTCCCGGGCACCGCCGACACCCCCGACGCCCCGGCCGCG 
2221 CGGGCCACCGCCGCCGAGGACACCGCCGAGGCCGTCTGGCTCGACGCCCGCACCGTCGCC 

2281 TGGCCGCGCGC 

FIG.-4-3. 
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